Hierarchical Protein Structure Superposition Using Both Secondary Structure and Atomic Representations

نویسندگان

  • Amit Pal Singh
  • Douglas L. Brutlag
چکیده

The structural comparison of proteins has become increasingly important as a means to identify protein motifs and fold families. In this paper we present a new algorithm for the comparison of proteins based on a hierarchy of structural representations, from the secondary structure level to the atomic level. Our technique represents alpha-helices and beta-strands as vectors and uses a set of seven scoring functions to compare pairs of vectors from different proteins. The scores obtained are used in a dynamic programming algorithm that finds the best local alignment of the two sets of vectors. The second step in our algorithm is based on the atomic coordinates of the protein structures and improves the initial vector alignment by iteratively minimizing the RMSD between pairs of nearest atoms from the two proteins. We refine the final alignment by determining a core of well aligned atoms and minimizing the RMSD of this core. In a comparison of our method to Holm and Sander's DALI algorithm, our program was able to detect structural similarity at the same level as DALI. We also performed searches of a representative set of the Protein Data Bank (PDB) using our program and detected structurally similarity between several distantly related proteins.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An investigation of neutron direct damages at energies of 0.1-2 MeV on the DNA molecules with atomic structure deduced using Geant4 toolkit

This study proposes a method to estimate RBE of fast neutrons using Monte Carlo simulations. This approach is based on the combination of an atomic resolution DNA geometrical model and Monte Carlo simulations for tracking particles. Atomic positions were extracted from the Protein Data Bank. The GEANT4 code was used for tracking the secondary particles generated by fast neutrons during their in...

متن کامل

Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...

متن کامل

Challenges in Protein Structure Prediction and Drug Discovery

Alignment or superposition of multiple flexible ligands in 3D is a key step in rational ligand-based drug design, pharmacophore elucidation and 3D QSAR analysis. We have recently introduced Atomic Property Fields methodology, which utilizes continuous Gaussian-based multicomponent potentials to represent the distributions of physico-chemical atomic properties. Calculation of APF pseudo-energy p...

متن کامل

FTIR Investigation of Secondary Structure of Reteplase Inclusion Bodies Produced in Escherichia coli in Terms of Urea Concentration

Recent studies suggest that reducing the induction temperature would improve the quality of some recombinant inclusion bodies (IB) by providing a native-like secondary structure and leading to an improvement in protein recovery. This study focused on optimizing the solubilization condition of Reteplase, a recombinant protein with 9 disulfide bonds. The influence of lowering induction temperatur...

متن کامل

FTIR Investigation of Secondary Structure of Reteplase Inclusion Bodies Produced in Escherichia coli in Terms of Urea Concentration

Recent studies suggest that reducing the induction temperature would improve the quality of some recombinant inclusion bodies (IB) by providing a native-like secondary structure and leading to an improvement in protein recovery. This study focused on optimizing the solubilization condition of Reteplase, a recombinant protein with 9 disulfide bonds. The influence of lowering induction temperatur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Proceedings. International Conference on Intelligent Systems for Molecular Biology

دوره 5  شماره 

صفحات  -

تاریخ انتشار 1997